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Abstract 

A combined search for the Standard Model Higgs boson with the ATLAS detector at the LHC 
is presented. The datasets used correspond to integrated luminosities from 4.6 fb 1 to 4.9 fb 1 of 
proton-proton collisions collected at y/s = 7 TeV in 201 1 . The Higgs boson mass ranges of 1 1 1 .4 GeV 
to 1 1 6.6 GeV, 119.4 GeV to 1 22. 1 GeV, and 129.2 GeV to 541 GeV are excluded at the 95% confidence 
level, while the range 120 GeV to 560 GeV is expected to be excluded in the absence of a signal. 
An excess of events is observed at Higgs boson mass hypotheses around 126 GeV with a local 
significance of 2.9 standard deviations (a). The global probability for the background to produce 
an excess at least as significant anywhere in the entire explored Higgs boson mass range of 110- 
600 GeV is estimated to be ~15%, corresponding to a significance of approximately la. 
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I. INTRODUCTION 

Probing the mechanism for electroweak symmetry 
breaking (EWSB) is one of the prime objectives of the 
Large Hadron Collider (LHC). In the Standard Model 
(SM) the electroweak interaction is described by a 

local gauge field theory with an SU(2) l ® U(l)y symme- 
try, and EWSB is achieved via the Higgs mechanism with 
a single SU(2) l doublet of complex scalar fields 0-0] • Af- 
ter EWSB the electroweak sector has massive W ± and Z 
bosons, a massless photon, and a massive CP-even, scalar 
boson, referred to as the Higgs boson. Fermion masses 
are generated from Yukawa interactions with couplings 
proportional to the masses of fermions. The mass of the 
Higgs boson, tuh, is a free parameter in the SM. However, 
for a given run hypothesis the cross sections of the vari- 
ous Higgs boson production processes and the branching 
fractions of the decay modes can be predicted, allowing a 
combined search with data from several search channels. 

Combined searches at the CERN LEP e + e~ collider 
excluded the production of a SM Higgs boson with mass 
below 114.4 GeV at 95% confidence level (CL) pj. The 
combined searches at the Fermilab Tevatron pp collider 
excluded the production of a SM Higgs boson with a mass 
between 147 GeV and 179 GeV, and between 100 GeV and 
106 GeV at 95% CL [ll|. Precision electroweak measure- 
ments are sensitive to mu via radiative corrections and 
indirectly constrain the SM Higgs boson mass to be less 
than 158 GeV at 95% CL. 

In 2011, the LHC delivered an integrated luminosity 
of 5.6 fb _1 of proton-proton (pp) collisions at a center- 
of-mass energy of 7 TeV to the ATLAS detector Of 
the 4.9 fb _1 collected, the integrated luminosity used in 
the individual Higgs search channels is between 4.6 fb _1 
and 4.9 fb _1 , depending on the data quality requirements 
specific to each channel. 

This paper presents a combined search for the SM 
Higgs boson in the decay modes H — > 77, H — > ZZ^*\ 
H -t WW^*\ H -> t + t~, and H -> bb, with subsequent 
decays of the W, Z, and r leading to different final states. 
Some searches are designed to exploit the features of the 
production modes pp — > H (gluon fusion), pp — > qqH 
(vector boson fusion) and pp —> VH with V = W or Z 



(associated production with a gauge boson). In order to 
enhance the search sensitivity, the various decay modes 
are further subdivided into sub-channels with different 
signal and background contributions and different sensi- 
tivities to systematic uncertainties. While the selection 
requirements for individual search channels are disjoint, 
each selection is, in general, populated by more than one 
combination of Higgs boson production and decay. For 
instance, Higgs boson production initiated by vector bo- 
son fusion (VBF) can contribute significantly to a search 
channel optimized for gluon fusion production. 

The ATLAS collaboration has previously published a 
similar but less extensive combined search for the Higgs 
boson [14] in data taken at the LHC in 2011. The CMS 
collaboration has also performed a combined analysis of 
Higgs searches with data collected in 2011 and have ob- 
tained similar results [l5| . In comparison to the analy- 
sis of Ref. [lj], the H — > t + t~ and H — » bb channels 
have been added, the H -> WW^> -> l + vl~V analy- 
sis has been updated and extended to cover the mass 
range range of 110-600 GeV, and the H -> WW -> ivqq 1 
H -> ZZ -> l+l-vv, and H -> ZZ -> £+£-qq analyses 
have been updated to use the full 2011 dataset. Both the 
H ->• WW^ -> i+isl-V and H -+ WW -> ivqq' anal- 
yses include a specific treatment of the 2-jets final state 
targeted at the VBF production process. 

The different channels entering the combination are 
summarized in Table |TJ After describing the general ap- 
proach to statistical modeling in Section |Hl the indi- 
vidual channels and the specific systematic uncertainties 
are described in Section Hill and Section ITVl respectively. 
The statistical procedure is described in Section [V] and 
the resulting exclusion limits and compatibility with the 
background-only hypothesis are presented in Section IVII 
and Section IVTIl respectively. 



II. STATISTICAL MODELING 

In this combined analysis, a given search channel, in- 
dexed by c, is defined by its associated event selection 
criteria, which may select events from various physical 
processes. In addition to the number of selected events, 
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TABLE I. Summary of the individual channels entering the combination. The transition points between separately optimized 
ran regions are indicated when applicable. The symbols ® and © represent direct products or sums over sets of selection 
requirements. The details of the sub-channels are given in Section ULTl 
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n, each channel may make use of an invariant or trans- 
verse mass distribution of the Higgs boson candidates. 
The discriminating variable is denoted x and its proba- 
bility density function (pdf) is written as f(x\a), where 
a represents both theoretical parameters such as ran and 
nuisance parameters associated with various systematic 
effects. These distributions are normalized to unit prob- 
ability. The predicted number of events satisfying the 
selection requirements is parameterized as v{ol). For 
a channel with n selected events, the data consist of 
the values of the discriminating variables for each event 
T> = {x\, . . . , x n }. The probability model for this type of 
data is referred to as an unbinned extended likelihood or 
marked Poisson model f , given by 



The combined model can then be written as follows 



f(V\a) =Pofe(n|i/(a)) JJ/0> 



(1) 



e=l 



For each channel several signal and background scat- 
tering processes contribute to the total rate v and the 
overall pdf f(x\a). Here, the term process is used for 
any set of scattering processes that can be added inco- 
herently. The total rate is the sum of the individual rates 

!/(€*)= J2 ( 2 ) 

fc(E processes 

and the total pdf is the weighted sum 

/N«) = -7—r Y] v k (a)f k (x\a) . (3) 
via) 

/cGproccsses 



fcoro(f com | ") = Yl 



Pois(n c |^ c (a)) f c (x ce \a) 



(4) 



Parameterization of the Model 



The parameter of interest is the overall signal strength 
factor /i, which acts as a scale factor to the total rate of 
signal events. This global factor is used for all pairings 
of production cross sections and branching ratios. The 
signal strength is defined such that fi = corresponds to 
the background-only model and fi = 1 corresponds to the 
SM Higgs boson signal. It is convenient to separate the 
full list of parameters a into the parameter of interest /i, 
the Higgs boson mass run, and the nuisance parameters 
6, i.e. a — (^,m//,0). 

Each channel in the combined model uses either the 
reconstructed transverse mass or the invariant mass of 
the Higgs candidate as a discriminating variable. Two 
approaches are adopted to model the signal pdfs at in- 
termediate values of ran where full simulation has not 
been performed. The first, used in the H — > 77 chan- 
nel, is based on a continuous parameterization of the sig- 
nal as a function of run using an analytical expression 
for the pdf validated with simulated Monte Carlo (MC) 
samples. The second, used in channels where pdfs are 
modeled with histograms, is based on an interpolation 
procedure using the algorithm of Ref. [24| . 



Using e as the index over the n c events in the c chan- 
nel, x ce is the value of the observable x for the e th event 
in channels 1 to c max . The total data are a collection of 
data from individual channels: T> com — {2?i, . . . ,£> Cmax }- 



B. Auxiliary Measurements 

The nuisance parameters represent uncertain aspects 
of the model, such as the background normalization, re- 
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construction efficiencies, energy scale and resolution, lu- 
minosity, and theoretical predictions. These nuisance 
parameters are often estimated from auxiliary measure- 
ments, such as control regions, sidebands, or dedicated 
calibration measurements. A detailed account of these 
measurements is beyond the scope of this paper and is 
given in the references for the individual channels [l6l - |23l | . 

Each parameter a p with a dedicated auxiliary mea- 
surement faux C^aux | CK P , Mother) provides a maximum like- 
lihood estimate for and a standard error (T„. Thus, 
the detailed probability model for an auxiliary measure- 
ment is approximated as 



faux(^aux I &p > ^othcr ) ^ fpi^p I ^pi ^p) 



(5) 



The f p (a p \a p , a p ) are referred to as constraint terms. 

The fully frequentist procedure applied for the present 
analysis includes randomizing the a p when constructing 
the ensemble of possible experiment outcomes. In the 
hybrid frequentist-Bayesian procedures used at LEP and 
the Tevatron, the a p are held constant and the nuisance 
parameters a p are randomized according to the prior 
probability density 



n(a p \a p ) oc f(a p \a p , a p )r](a p ) 



(6) 



where r)(a p ) is an original prior, usually taken to be con- 
stant. 

The set of nuisance parameters constrained by auxil- 
iary measurements is denoted § and the set of estimates 
of those parameters, also referred to as global observables 
which augments T> com , is denoted Q = {a p } with p e S. 
Including the constraint terms explicitly, the model can 
be rewritten to 



ItotC Z ?com,£|«) = Yi 



Pois(n c |^ c (a)) f c (x ce \a) 



pes 



(7) 



The use of a Gaussian constraint term f p (a p \a p , a p ) = 
Gauss(a p |a p , a p ) is problematic if the parameter is intrin- 
sically non-negative, as is the case for event yields and 
energy scale uncertainties. This is particularly impor- 
tant when the relative uncertainty is large. An alterna- 
tive constraint term defined only for positive parameter 
values is the log-normal distribution, which is given by 



f p (a p \a p ) 



1 



1 



2tt In n a p 



exp 



(\n(a p /a p )f 



(8) 



The conventional choice k = 1 + oVd is made, where 
cr ro i is the relative uncertainty a p /a p from the observed 
auxiliary measurement [25J. 

Using the log-normal distribution for a p is equivalent 
to having a Gaussian constraint for the transformed pa- 
rameter a' — In Or, and a' = lna„. 



For channels that use histograms based on simulated 
MC samples, the parametric pdf f(x\a) is formed by 
interpolating between histogram variations evaluated at 
a p = a p ±a p . Since the variations need not be symmetric, 
the function is treated in a piecewise way using a sixth- 
order polynomial to interpolate in the range a p € [a p — 
tjp, a p + a p ] with coefficients chosen to match the first and 
second derivatives [261] . Henceforth, the prime will be 
suppressed and a p will refer to the transformed nuisance 
parameter. 

Not all systematic uncertainties have an associated 
auxiliary measurement. For example, uncertainties as- 
sociated with the choice of renormalization and factor- 
ization scales and missing higher-order corrections in a 
theoretical calculation are not statistical in nature. In 
these cases, the frequentist form of the constraint term 
is derived assuming, by convention, a log-normal prior 
probability density on these parameters and inverting 
Eq. ©. 



III. INDIVIDUAL SEARCH CHANNELS 

All the channels combined to search for the SM Higgs 
boson use the complete 2011 dataset passing the relevant 
quality requirements. The Higgs boson decays considered 
areff -> 77, H -> WW { *\ H ->■ ZZ^*\ H -> r+r", and 
H — > bb. In modes with a W or Z boson, an electron or 
muon is required for triggering. In the H — > t + t~ chan- 
nel, almost all combinations of subsequent r decays are 
considered. The results in the 77 and £ + £~£ + £~ modes 
are the same as in the previously published combina- 
tion [l4j, but all other channels have been updated. A 
summary of the individual channels contributing to this 
combination is given in Table HI 

The invariant and transverse mass distributions for the 
individual channels are shown in Figs. [T] and [21 with sev- 
eral sub-channels merged. 

• H — > 77: This analysis is unchanged with respect 
to the previous combined search [14l . Il6l ] and is car- 
ried out for ran hypotheses between 110 GeV and 
150 GeV. Events are separated into nine indepen- 
dent categories of varying sensitivity. The catego- 
rization is based on the pseudorapidity of each pho- 
ton, whether it was reconstructed as a converted 
or unconverted photon, and the momentum com- 
ponent of the diphoton system transverse to the 
diphoton thrust axis (prt)- The mass resolution is 
approximately 1.7% for m#~120 GeV. 
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FIG. 1. Invariant or transverse mass distributions for the selected candidate events, the total background and the signal expected 
in the following channels: (a) H -» 77, (b) H -> ZZ { * ] -> £+£'£+£- in the entire mass range, (c) H ->■ ZZ^ -> t+Tl+T in 
the low mass range, (d) H — > ZZ — > £ + £~vv, (e) b-tagged selection and (f) untagged selection for H — > ZZ — > £ + £~qq, (g) H — > 
WW M -> £ + v£-V+0-jets, (h) H ->■ WW M ->■ i+vrv+l-jet, (i) H -> WW (t) -» l + v£~V+2-iets, (j) H -» WW ->■ £uqq+Q- 
jets, (k) H -> WW £vqq'+l-iet and (1) H -> WW -> £vqq' +2-jets. The H -> WW {t) -> £ + vrv+2-]ets distribution is 
shown before the final selection requirements are applied. 
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FIG. 2. Invariant or transverse mass distributions for the selected candidate events, the total background and the signal expected 
in the following channels: (a) H — > ri cp ri cp +0-jets, (b) H — > T\ cp T\ cp 1-jet, (c) H — > ri cp ri cp +2-jets, (d) H — > Ti cp Th a d+0-jets and 
1-jet, (e) H — > riopThad+2-jets, (f) H —> r had r had . The bb invariant mass for (g) the ZH — > £ + £~bb, (h) the WH — > Ivbb and (i) 
the ZH — > vvbb channels. The vertical dashed lines illustrate the separation between the mass spectra of the subcategories in 
Pt i Pt i an d E™ 1BB , respectively. The signal distributions are lightly shaded where they have been scaled by a factor of five or 
ten for illustration purposes. 
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• H -» ZZ^ : In the ZZ^ decay mode at least one 
Z is required to decay to charged leptons, while the 
other decays to either leptons, neutrinos, or jets. 

- H ->■ ZZ^ ->■ £+£-£+£-: This analysis, de- 
scribed in Ref. [l7| . is performed for ran hypothe- 
ses in the 110 GeV to 600 GeV mass range and 
is unchanged with respect to the previous com- 
bined search [ljj]. The main irreducible ZZ^ back- 
ground is estimated using a combination of Monte 
Carlo simulation and the observed data. The re- 
ducible Z+jets background, which mostly impacts 
the low four-lepton invariant mass region, is esti- 
mated from control regions in the data. The top- 
quark (tt) background normalization is validated 
using a dedicated control sample. Four categories 
of events are defined by the lepton flavor combina- 
tions and the four-lepton invariant mass is used as 
a discriminating variable. The mass resolution is 
approximately 1.5% in the four-muon channel and 
2% in the four-electron channel for m#~120 GeV. 

- H —> ZZ —> £ + l~vv: This analysis [l8j is split 
into two regimes according to the level of pile-up, 
i.e. the average number of pp collisions per bunch 
crossing. The first 2.3 fb _1 of data had an aver- 
age of about six pile-up collisions per event and 
the subsequent 2.4 fb" 1 had an average of about 
twelve. The search is performed for tuh hypothe- 
ses ranging from 200 GeV to 600 GeV. The analysis 
is further categorized by the flavor of the leptons 
from the Z decay. The selection is optimized sep- 
arately for Higgs boson masses above and below 
280 GeV. The £ + £~ invariant mass is required to be 
within 15 GeV of the Z boson mass. The inverted 
requirement is applied to same-flavor leptons in the 
H — > WW^ — > £ + v£~V channel to avoid overlap 
in the selection. The transverse mass (ttit), com- 
puted from the dilepton transverse momentum and 
the missing transverse momentum, is used as a dis- 
criminating variable. 

- H — > ZZ — s- £ + £~qq: This search is performed for 
ran hypotheses ranging from 200 GeV to 600 GeV 
and is separated into search regions above and be- 
low mn—iOO GeV, for which the event selections 
are independently optimized. The dominant back- 
ground arises from Z+jets production, which is es- 
timated using sidebands of the dijet invariant mass 
distribution in data. To profit from the relatively 
large rate of 6-jets from Z boson decays present 
in the signal compared to the rate of 6-jets found 
in the Z+jets background, the analysis is divided 
into two categories. The first category contains 
events in which the two jets are 6-tagged and the 
second uses events with less than two 6-tags. The 
analysis [l9[ takes advantage of a highly efficient 
6-tagging algorithm [27| and the sideband to con- 
strain the background yield. Using the Z boson 
mass constraint improves the mass resolution of the 



l + (rqq system by more than a factor of two. The 
invariant mass of the £ + £~qq system is used as a 
discriminating variable. 

• H — >• WW^*': Two sets of channels are devoted 
to the decay of the Higgs boson into a pair of W 
bosons, namely the £ + v£~V and £vqq' channels. 

- H -> WW^ -> £ + v£-V: The updated analy- 
sis [13] is performed for mn values from 110 GeV 
up to 600 GeV. Events with two leptons are clas- 
sified by the number of associated jets (0, 1 or 2), 
where the two-jet category has selection criteria de- 
signed to enhance sensitivity to the VBF produc- 
tion process. The events are further divided by 
the flavors of the charged leptons, ee, e/i and mi 
where the mixed mode (e/i) has a much smaller 
background from the Drell-Yan process. As in the 
case of H — » ZZ — > £ + £~vv, the samples are split 
according to the pile-up conditions and analyzed 
separately. Each sub-channel uses the WW trans- 
verse mass distribution, except for the 2-jets cate- 
gory, which does not use a discriminating variable. 

- H — » WW — > £vqq': This analysis is per- 
formed for niH hypotheses ranging from 300 GeV 
to 600 GeV. A leptonically decaying W boson is 
tagged with an isolated lepton and missing trans- 
verse momentum (_E™ ISS ). Additionally, two jets 
with an invariant mass compatible with a second 
W boson [21] are required. The W boson mass 
constraint allows the reconstruction of the Higgs 
boson candidate mass on an event-by-event basis 
by using a quadratic equation to solve for the com- 
ponent of the neutrino momentum along the beam 
axis. Events where this equation has imaginary 
solutions are discarded in order to reduce tails in 
the mass distribution. The analysis searches for 
a peak in the reconstructed £vqq' mass distribu- 
tion. The background is modeled with a smooth 
function. The analysis is further divided by lepton 
flavor and by the number of additional jets (0, 1 or 
2), where the two jet channel is optimized for the 
VBF production process. 

• H — s- t + t~: The analyses [H| are categorized by 
the decay modes of the two t leptons, for m# hy- 
potheses ranging from 110 GeV to 150 GeV (the 
leptonically decaying r leptons are denoted ri ep and 
the hadronically decaying r leptons are denoted 
Thad)- Most of these sub-channels are triggered us- 
ing leptons, except for the fully hadronic channel 
H ThadThad; which is triggered with specific dou- 
ble hadronic r decay selections. All the searches 
using r decay modes have a significant background 
from Z — > t + t~ decays, which are modeled using 
an embedding technique where Z — > fi + /i~ candi- 
dates selected in the data have the muons replaced 
by simulated r decays [22| ■ These embedded events 
are used to describe this background process. 
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- H — > ricpTicp! In this channel events are sepa- 
rately analyzed in four disjoint categories based on 
the number of reconstructed jets in the event [HJ . 
There are two categories specifically aimed towards 
the gluon fusion production process, with or with- 
out a jet, one for the VBF production process and 
one for the Higgs boson production in association 
with a hadronically decaying vector boson. Each 
jet category requires at least one jet with px above 
40 GeV. The collinear approximation [28] is used to 
reconstruct the rr invariant mass, which is used as 
the discriminating variable. All three combinations 
of e and p, are used, except in the 0-jets category, 
which uses only the e/z candidate events where the 
effective mass is used as a discriminating variable. 

- H — > TicpThad^ There are seven separate cate- 
gories in this sub-channel. The selection of VBF- 
like events requires two jets with oppositely signed 
pseudorapidities rj, \^-f]jj\ > 3.0 and a dijet in- 
variant mass larger than 300 GeV, in which events 
with electrons and muons are combined due to the 
limited number of candidates. In the other sub- 
channels, electron and muon final states are con- 
sidered separately. The remaining candidate events 
are categorized according to the number of jets 
with transverse momenta in excess of 25 GeV, the 
0-jets category being further subdivided based on 
whether the E™ iss exceeds 20 GeV or not. The 
Missing Mass Calculator (MMC) technique [II] is 
used to estimate the rr invariant mass, which is 
used as a discriminating variable. 

- H + jet — > ThadThad + jet: Events are triggered 
using a selection of two hadronically decaying r 
leptons with transverse energy thresholds varying 
according to the running conditions (22j . Two 
oppositely-charged hadronically decaying r candi- 
dates are required along with one jet with trans- 
verse momentum larger than 40 GeV, E™ lss > 
20 GeV and a reconstructed invariant mass of the 
two r leptons and the jet greater than 225 GeV. 
In addition to the Z background there is a signifi- 
cant multijet background which is estimated using 
data-driven methods. The tt invariant mass is es- 
timated via the collinear approximation and is used 
as a discriminating variable after further selections 
on the momentum fractions carried away by visible 
r decay products. 

• H -> bb: The ZH -> t + l~bb, ZH ->■ vvbb, and 
WH — > Ivbb analyses [23| are performed for to# 
ranging from 110 GeV to 130 GeV. All three analy- 
ses require two 6-tagged jets (one withpx > 45 GeV 
and the other with px > 25 GeV) and the invariant 
mass of the two 6-jets, rribb, is used as a discriminat- 
ing variable. The ZH —> £ + £~bb analysis requires 
a dilepton invariant mass in the range 83 GeV< 
m u < 99 GeV and £ T liss < 50 GeV to suppress 
the tt background. The WH —> Ivbb analysis re- 



quires E™ lss > 25 GeV, the transverse mass of 
the lepton- E^ 1 " 38 system to be in excess of 40 GeV, 
and no additional leptons with px > 20 GeV. The 
ZH — > vvbb analysis requires E™ lss > 120 GeV, as 
well as p^ lss > 30 GeV, where p™ lss is the missing 
transverse momentum determined from the tracks 
associated with the primary vertex. To increase 
the sensitivity of the search, the rribb distribution is 
examined in sub-channels with different signal-to- 
background ratios. In the searches with one or two 
charged leptons, the division is made according to 
four bins in transverse momentum p^ of the recon- 
structed vector boson V: p^ < 50 GeV, 50 GeV < 
p% < 100 GeV, 100 GeV < y\ < 200 GeV and 
p^ > 200 GeV. In the ZH -> vvbb search the Ef iss 
is used to define three sub-channels correspond- 
ing to 120 GeV < £ T liss < 160 GeV, 160 GeV < 
E™ [ss < 200 GeV, and E™ iss > 200 GeV. No cate- 
gorization is made based on lepton flavor. 



IV. SYSTEMATIC UNCERTAINTIES 

The sources of systematic uncertainties and their ef- 
fects on the signal and background rates Vk{ct) and dis- 



ci) are described 
The sources 



criminating variable distributions fk(x 
in detail for each channel in Refs. [1 
of systematic uncertainty are decomposed into uncorre- 
cted components, such that the constraint terms factor- 
ize as in Eq. ([7]). The main focus of the combination of 
channels is the correlated effect of given sources of uncer- 
tainties across channels. Typically, the correlated effects 
arise from the ingredients common to several channels, 
for example the simulation, the lepton and photon iden- 
tification, and the integrated luminosity. The sources of 
systematic uncertainty affecting the signal model are fre- 
quently different from those affecting the backgrounds, 
which are often estimated from control regions in the 
data. The dominant uncertainties giving rise to corre- 
lated effects are those associated with theoretical predic- 
tions for the signal production cross sections and decay 
branching fractions, as well as those related to detector 
response affecting the reconstruction of electrons, pho- 
tons, muons, jets, _E™ 1SS and 6-tagging. The log-normal 
constraint terms are used for uncertainties in the sig- 
nal and background normalizations, while Gaussian con- 
straints are used for uncertainties affecting the shapes of 
the pdfs. 



A. Theoretical Uncertainties Affecting the Signal 

The Higgs boson production cross sections are com- 
puted up to next-to-next-to-leading order (NNLO) (30T - 
l35j in QCD for the gluon fusion (gg — > H) process, in- 
cluding soft-gluon resummation up to next-to-next-to- 
leading log (NNLL) (3(| and next-to-leading-order (NLO) 
electroweak corrections [37| |38j]. These predictions are 
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compiled in Refs. [39H41|. The cross section for the 
VBF process is estimated at NLO [42T - |44j and approx- 
imate NNLO QCD [45]. The cross sections for the as- 
sociated production processes (qq — > WH/ZH) are com- 
puted at NLO [MElI, NNLO Q QCD and NLO elec- 
troweak j42|. The cross section for the associated pro- 
duction with a tt pair (qq/ 1 gg — ¥ ttH) are estimated at 
NLO [49145 3j . The Higgs bos on p roduction cross sections 
and decay branching ratios [54l458| . as well as their re- 
lated uncertainties, are compiled in Ref. [5!|. The QCD 
scale uncertainties for m#=120 GeV amount to ^g 2 % for 
the gg — » H process, ±1% for the qq' — > qq' H and asso- 
ciated WH/ZH processes, and for the qq/gg — > ttH 
process. The uncertainties related to the parton distri- 
bution functions (PDF) amount to ±8% for the predom- 
inantly gluon-initiated processes gg — > H and gg — > ttH , 
and ±4% for the predominantly quark-initiated processes 
qq' -> qq'H and WH/ZH processes HO]. The theo- 
retical uncertainty associated with the exclusive Higgs 
boson production process with one additional jet in the 
H -> WW^> £+vl~V channel amounts to ±20% and 
is treated according to the prescription of Refs. [25L l6lj. 
An additional theoretical uncertainty on the signal nor- 
malization, to account for effects related to off-shell Higgs 
boson production and interference with other SM pro- 
cesses, is assigned at high Higgs boson masses (mg > 
300 GeV) and estimated as ±150%x (m H /TeV) 3 [E0464|. 



B. Theoretical Uncertainties Affecting the 
Background 

In the H — > 77 and H — > WW — > £vqq' channels the 
backgrounds are estimated from a fit to the data. This 
removes almost all sensitivity to the corresponding the- 
oretical uncertainties. In the case of the H — ¥ 77 analy- 
sis an additional uncertainty is assigned to take into ac- 
count possible inadequacies of the analytical background 
model chosen. Theoretical uncertainties enter in all other 
channels where theoretical calculations are used for back- 
ground estimates. In particular, both signal and back- 
ground processes are sensitive to the parton distribution 
functions, the underlying event simulation, and the par- 
ton shower model. 

The ZZ^ continuum process is the main background 
for the H -> ZZ& -> £+£-£+£- and H -> ZZ -> 
£ + £~vv analyses and is also part of the backgrounds in 
the H -> ZZ -> £ + £-qq channel. An NLO prediction [65| 
is used for the normalization. The QCD scale uncertainty 
has a ±5% effect on the expected ZZ^ background, 
and the effect due to the PDF and as uncertainties is 
±4% and ±8% for quark-initiated and gluon-initiated 
processes respectively. An additional theoretical uncer- 
tainty of ±10% on the inclusive ZZ^*> cross section is 
conservatively included due to the missing higher-order 
QCD corrections for the gluon-initiated process. This 
theoretical uncertainty is treated as uncorrelated for the 
different channels due to the different acceptance in the 



H ZZM £+£-£+£- and H -> ZZ £+£-vv chan- 
nels and because its contribution to the H —> ZZ —> 
£ + £rqq channel is small. 

In most other channels the overall normalization of the 
main backgrounds is not estimated from theoretical pre- 
dictions; however, simulations are used to model the pdfs 
f(x\a) or the scale factors used to extrapolate from the 
control regions to the signal regions. For example, in the 
H -> WW^ -> l+vl~V channel the main backgrounds 
are continuum WW^*' and tt production. Their normal- 
izations are estimated in control regions; however, the 
factors used to extrapolate to the signal region are esti- 
mated with the NLO simulation 66] . 



C. Experimental Uncertainties 

The uncertainty on the integrated luminosity is con- 
sidered as being fully correlated among channels and 
amounts to ±3.9% [63, HI]. 

The detector-related sources of systematic uncertainty 
can affect various aspects of the analysis: (a) the overall 
normalization of the signal or background, (b) the migra- 
tion of events between categories and (c) the shape of the 
discriminating variable distributions f(x\a). Similarly to 
the theoretical uncertainties, experimental uncertainties 
on the event yields (a) are treated using a log-normal 
f p (a p \a p ) constraint pdf. In cases (b) and (c) a Gaussian 
constraint is applied. 

The experimental sources of systematic uncertainty are 
modeled using the classification detailed below. Their 
effect on the signal and background yields in each channel 
separately is reported in Table |TTJ The various sources of 
systematic uncertainty have in some cases been grouped 
for a concise presentation (e.g. the jet energy scale and 
^-tagging efficiencies), while the full statistical model of 
the data provides a more detailed account of the various 
systematic effects including the effect on the pdfs f(x\a). 
The assumptions made in the treatment of systematics 
are outlined below. 

• The uncertainty in the trigger and identification ef- 
ficiencies are treated as fully correlated for electrons 
and photons. The energy scale and resolution for 
photons and electrons are treated as uncorrelated 
sources of uncertainty. 

• The uncertainties affecting muons are separated 
into those related to the inner detector (ID) and 
the muon spectrometer (MS) in order to provide 
a better description of the correlated effect among 
channels using different muon identification criteria 
and different ranges of muon transverse momenta. 

• The Jet Energy Scale and Jet Energy Resolution 
are sensitive to a number of uncertain quantities, 
which depend on pr, rj, and flavor of the jet. Mea- 
surements of the JES and JER result in compli- 
cated correlations among these components. Build- 
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TABLE II. A summary of the main correlated experimental systematic uncertainties. The uncorrelated systematic uncertainties 
are summarized in a single combined number. The uncertainties indicate the ±lcr relative variation in the signal and background 
yields in (%). The signal corresponds to a Higgs boson mass hypothesis of 125 GeV except for H — > ZZ — > £ + £~qq, H — > 
ZZ -> l + l~i>V and H -)■ WW -> ivqq' , which are quoted at 350 GeV. 
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ing a complete model of the response to these cor- 
related sources of uncertainty is intricate. Here, 
a simplified scheme is used in which independent 
JES and JER nuisance parameters are associated to 
channels with significantly different kinematic re- 
quirements and scattering processes with different 
kinematic distributions or flavor composition. This 
scheme includes a specific treatment for 6-jets. The 
sensitivity of the results to various assumptions in 
the correlation between these sources of uncertainty 
has been found to be negligible. Furthermore, an 
additional component to the uncertainty in _E™ SS , 



which is uncorrelated with the JES uncertainty, is 
included. 

• While the r energy scale uncertainty is expected 
to be partially correlated with the JES, here it is 
treated as an uncorrelated source of uncertainty. 
This choice is based on the largely degenerate ef- 
fect due to the uncertainty associated with the em- 
bedding procedure, in which the simulated detec- 
tor response to hadronic r decays is merged with 
a sample of Z — > fi + /i~ data events. Further- 
more, the uncertainty of this embedding procedure 
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is treated separately for signal and background pro- 
cesses, which is a conservative approach given that 
Z — > t+t - sideband effectively constrains this nui- 
sance parameter. 

• The 6-tagging systematic uncertainty is decom- 
posed into five fundamental sources in the H — > bb 
channels, while a simplified model with a single 
source is used in the H — » ZZ — > £ + £~qq chan- 
nel. The uncertainty in the &-veto is considered 
uncorrelated with the uncertainty in the 6-tagging 
efficiency. 

The effect of these systematic uncertainties depends 
on the final state, but is typically small compared to the 
theoretical uncertainty of the production cross section. 

The electron and muon energy scales are directly con- 
strained by Z — > e + e~ and Z — > /J + /i~ events; the impact 
of the resulting systematic uncertainty on the four-lepton 
invariant mass is of the order of ±0.5% for electrons and 
negligible for muons. The impact of the photon energy 
scale systematic uncertainty on the diphoton invariant 
mass is approximately ±0.6%. 



D. Background Measurement Uncertainty 

The estimates of background normalizations and 
model parameters from control regions or sidebands are 
the main remaining source of uncertainty. Because of the 
differences in control regions these uncertainties are not 
correlated across channels. 

In the case of the H — s- bb channels the background nor- 
malizations are constrained both from sideband fits and 
from auxiliary measurements based on the MC predic- 
tion of the main background processes (Z±jets, IV+jcts 
and tt). 

The uncorrelated sources of systematic uncertainties 
are summarized as a single combined number in Table [TT] 
for each channel. 



E. Summary of the Combined Model 

To cover the search range efficiently between 
rn#=110 GeV and m#=600 GeV, the signal and back- 
grounds are modeled and the combination performed in 
mn steps that reflect the interplay between the invariant 
mass resolution and the natural width of the Higgs boson. 
In the low mass range, where the high mass-resolution 
H -> 77 and H -> ZZW £+£-£+£' channels dom- 
inate, the signal is modeled in steps from 500 MeV to 
2 GeV. For higher masses the combination is performed 
with step sizes ranging from 2 GeV to 20 GeV. The ttih 
step sizes are given in Table IIIII 

The combined model and statistical procedure are 
implemented within the RooFit, HistFactory, and 
RooStats software framework (2(| H^, [7(| ■ As shown in 



TABLE III. Step sizes in Higgs boson mass hypotheses at 
which the signal and backgrounds are modeled. 



run [GeV] Step size 

110-120 1 GeV 

120-130 0.5 GeV 
130-150 1 GeV 
150-290 2 GeV 
290-350 5 GeV 

350-400 10 GeV 

400-600 20 GeV 



Table UJ the number of channels included in the combi- 
nation depends on the hypothesized value of mn- The 
details for the number of channels, nuisance parameters, 
and constraint terms for various m# ranges are shown 
in Table IIVI For m# = 125 GeV there are 70 channels 
included in the combined statistical model and the asso- 
ciated dataset is comprised of more than 22,000 unbinned 
events and 8,000 bins. For ran— 350 GeV there are 46 
channels included in the combined statistical model and 
the associated dataset is comprised only of binned distri- 
butions, for which there are more than 4,000 bins. Due 
to the limited size of the MC samples, additional nui- 
sance parameters and constraint terms are included in 
the model to account for the statistical uncertainty in 
the MC templates. The difference between the number 
of nuisance parameters and the number of constraints 
reported in Table IIVI corresponds to the number of nui- 
sance parameters without external constraints, which are 
estimated in data control regions or sidebands. 

TABLE IV. Details of the combined model for different mn 
ranges. The table shows the number of channels, nuisance pa- 
rameters associated with systematic uncertainties, number of 
constraint terms, and number of additional nuisance param- 
eters and constraints associated to limited MC sample sizes. 



rim [GeV] Channels Nuisance Constraints MC stat 
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46 
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112 
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46 


201 


111 


112 



V. STATISTICAL PROCEDURES 

The procedures for computing frequentist p-values 
used for quantifying the agreement of the data with the 
background-only hypothesis and for determining exclu- 
sion limits are based on the profile likelihood ratio test 



11 



statistic. 

For a given dataset 2? com and values for the global 
observables Q there is an associated likelihood function 
of /i and 9 derived from the combined model over all 
channels including all constraint terms in Eq. ([7)1 

L(fi,d;m H ,T> com ,g) = f to t(£>com, G\n, m H ,9) . (9) 

The notation L(/i, 9) leaves the dependence on the data 
implicit. 

A. The Test Statistics and Estimators of p, and 9 



signal events being produced at a lesser rate // < li: 

f-21nA(/z) A < m, 
q^ = \ - x (13) 

Again, a sign change is introduced in order to probe 
p- values larger than 50%. The test statistic — 21nA(/z) 
is used to differentiate signal events being produced at 
a rate li from the alternative hypothesis of signal events 
being produced at a different rate \j! ^ \i. 

Tests of ll are carried out with the Higgs mass run 
fixed to a particular value, and the entire procedure is 
repeated for values of tuh spaced in small steps. 



The statistics used to test different values of the 
strength parameter fi are defined in terms of a likeli- 
hood function L(fi,9). The maximum likelihood esti- 
mates (MLEs) fi and 9 are the values of the parameters 
that maximize the likelihood function L(fi,9). The con- 
ditional maximum likelihood estimate (CMLE) 9{li) is 
the value of 9 that maximizes the likelihood function 
with (x fixed. The tests are based on the profile likeli- 
hood ratio A(/i), which reflects the level of compatibility 
between the data and ll. It is defined as 



L(£,0) 



(10) 



Physically, the rate of signal events is non-negative, 
thus /j. > 0. However, it is convenient to define the esti- 
mator p as the value of /i that maximizes the likelihood, 
even if is negative (as long as the pdf f c (x c \p,,9) > 
everywhere). In particular, p < indicates a deficit 
of events with respect to the background in the signal 
region. Following Ref. [7l[ a treatment equivalent to re- 
quiring fi > is to allow /i < and impose the constraint 
in the test statistic itself, i.e. 



A(/i) 



L(oi(0)) 



A>o, 



p < 0. 



(11) 



To quantify the significance of an excess, the test statis- 
tic qa is used to test the background-only hypothesis 
ii = against the alternative hypothesis /i > 0. It is 
defined as 



3o 



'-21nA(0) p > 0, 
+21nA(0) £< 0. 



(12) 



Instead of defining the test statistic to be identically zero 
for p < as in Ref. [7l|, this sign change is introduced 
in order to probe p-values larger than 50%. 

For setting an upper limit on //, the test statistic 
is used to test the hypothesis of signal events being pro- 
duced at a rate li against the alternative hypothesis of 



B. The Distribution of the Test Statistic and 
p- values 

When calculating upper limits, a range of values of ll 
is explored using the test statistic q^. The value of the 
test statistic for the observed data is denoted q^obs- One 
can construct the distribution of qy, assuming a different 
value of the signal strength //, which is denoted 



f{q^\fi' \m H ,8) 



(14) 



The distribution depends explicitly on mn and 8. The 
p- value is given by the tail probability of this distribution, 
and thus the p- value will also depend on to# and 9. The 
reason for choosing the test statistic based on the profile 
likelihood ratio is that, with sufficiently large numbers 
of events, the distribution of the profile likelihood ratio 
with = pi is independent of the values of the nuisance 
parameters and, thus also the associated p-values. 

In practice, there is generally some residual depen- 
dence of the p-values on the value of 9. The values of 
the nuisance parameters that maximize the p-value are 
therefore sought. Following Refs. [25l . [72l - l75| . the v- values 
for testing a particular value of li are based on the dis- 
tribution constructed at 0(^,obs), the CMLE estimated 
with the observed data, as follows: 



fiq^lfi, m H ,9(Li, obs)) c% 



(15) 



</„.. 



The ensemble includes randomizing both T> and Q. 

Here the distribution of q^ assumes that the data V as 
well as the global observables Q are treated as measured 
quantities, i.e., they fluctuate upon repetition of the ex- 
periment according to the model ftot^com, G\/J,, mn, 8). 

Upper limits for the strength parameter li are calcu- 
lated using the C'L S procedure [76{ . To calculate this 
limit, the quantity CL S is defined as the ratio 



CL s {p) 



(16) 



1 ~Pb 

where pb is the p- value derived from the same test statis- 
tic under the background-only hypothesis 

poo 

Pb = l- f(q tl \0,m H ,e(p = 0,obs))dq fl . (17) 
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The CL S upper limit on /j, is denoted ^ up and obtained 
by solving for CL s (ii np ) — 5%. A value of \x is regarded 
as excluded at the 95% confidence level if fx > /x up . 

The significance of an excess is based on the compat- 
ibility of the data with the background-only hypothesis. 
This compatibility is quantified by the following p- value: 



Po 



f(q o \0, m H , 00 = 0, obs))d<7o 



(18) 



'/(!., 



Note that po and pb are both p- values of the background- 
only hypothesis, but the test statistic qo in Eq. (|18p is op- 
timized for discovery while the test statistic in Eq. (|17l) 
is optimized for upper limits. 

It is customary to convert the background-only p- value 
into an equivalent Gaussian significance Z (often written 
Za). The conversion is defined as 



Z = $- 1 (l-p ) , 



(19) 



where $ 1 is the inverse of the cumulative distribution 
for a standard Gaussian. 



C. Experimental Sensitivity and Bands 

It is useful to quantify the experimental sensitivity by 
means of the significance one would expect to find if a 
given signal hypothesis were true. Similarly, the expected 
upper limit is the median upper limit one would expect 
to find if the background-only hypothesis were true. Al- 
though these are useful quantities, they are subject to a 
certain degree of ambiguity because the median values 
depend on the assumed values of all of the parameters of 
the model, including the nuisance parameters. 

Here, the expected upper limit is defined as the me- 
dian of the distribution /(/x up |0, mn , 0(/x = 0, obs)) and 
the expected significance is based on the median of the 

distribution f(po\l, ms, 9(fi — l,obs)). The expected 
limit and significance thus have a small residual depen- 
dence on the observed data through 0(/i,obs). 

These distributions are also used to define bands 
around the median upper limit. The standard presen- 
tation of upper limits includes the observed limit, the 
expected limit, a ±ler and a ±2<r band. More precisely, 
the edges of these bands, denoted /x up ±i and fi U p±2, are 
defined by 



Mup±JV 



f{fi up \O,m H ,0(n = 0,obs))d^ up = $(±JV) 



(20) 



D. Asymptotic Formalism 



For large data samples, the asymptotic distributions 
/(^mI/ 1 '' m H, &) and /(qoIm'j m ffi ®) are known and de- 
scribed in Ref. [7lj |. These formulae require the variance 



of the maximum likelihood estimate of /x given // is the 
true value: 



var[/i|/i 



(21) 



One result of Ref. 71] is that oy can be estimated with 
an artificial dataset referred to as the Asimov dataset. 
This dataset is defined as a binned dataset, where the 
number of events in bin b is exactly the number of events 
expected in bin b. The value of the test statistic evaluated 
on the Asimov data is denoted q^^A^ , where the subscript 
A a denotes that this is the Asimov data associated with 
a. A convenient way to estimate the variance of /t is 



¥■ - ¥■ 



(22) 



In the asymptotic limit, a fL is parabolic, thus ay is 
independent of /x. In Ref. [7l| . the bands around the 
expected limit are given by 



/iup+iv = a ($~ : (1-0.05 *(JV)) + JV) 



(23) 



An improved procedure is used here in order to cap- 
ture the leading deviations of q^ from a parabola, cor- 
responding to departures in the distribution of jx from a 
Gaussian distribution centered at u! with variance a,,,. 
Finding the upper limit /x up (still using the formulae 
in Ref. [7l|) for an Asimov dataset constructed with 

ol 7v = (hn, ttih, Oil^Ni obs)), where /xjv is the value of /x 
corresponding to the edges of the h up ±n band. It is found 
to be more accurate than Eq. (|23|) in reproducing the 
bands obtained with ensembles of pseudo-experiments. 
The value of hn used for the +N<r band is the value of 

fi that satisfies \J% N ,A = N. The choice of 0(/ijv,obs) 
is more indicative of 8 for the pseudo-experiments that 
have /x up near the corresponding band. 



E. Bayesian Methods 

A posterior distribution for the signal strength pa- 
rameter /i can be obtained, via Bayes's theorem, with 
ftot(f j G\ a ) and a prior T)(a). The information about the 
nuisance parameters from auxiliary measurements is in- 
corporated into i to t(T>, Q\ol) through the constraint terms 
f p (a p \a p ). As in Eq. ©, the prior on rj(a p ) is taken as a 
uniform distribution. The upper limits are calculated for 
a given value of run, so no prior on run is needed. The 
prior on the signal strength parameter /x is taken to be 
uniform as this choice leads to one-sided Bayesian cred- 
ible intervals that correspond numerically with the CL S 
upper limits in the frequentist formalism for the simple 
Gaussian and Poisson cases, and have been observed to 
coincide in more complex situations. The one-sided, 95% 
Bayesian credible region is defined as 



ftot(£>sim, G\v, m H, 0) T](fi)i](0) dfi d0 = 0.95 



(24) 
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The integration over nuisance parameters is carried out 
with Markov Chain Monte Carlo using the Metropolis- 
Hastings algorithm in the RooStats package (26|. 

F. Correction for the Look Elsewhere Effect 



the ±2(7 band. This procedure has been validated using 
ensemble tests and the Bayesian calculation of the exclu- 
sion limits with a uniform prior on the signal strength 
described in Section IV El These approaches yield limits 
on /i which typically agree with the asymptotic results 
within a few percent. 



By scanning over mu and repeatedly testing the 
background-only hypothesis, the procedure is subject 
to effects of multiple testing referred to as the look- 
elsewhere effect. In principle, the confidence intervals 
can be constructed in the fi — wih plane using the profile 
likelihood ratio A(/i, m#). For /i = there is no signal 
present, thus the model is independent of mu . This leads 
to a background-only distribution for —2 In A(0, mu) that 
departs from a chi-square distribution, thus complicating 
the calculation of a global po when mu is not specified. 
The global test statistic is the supremum of qo{mu) with 
respect to mu 



qo(m H ) = sup g (m H ) ■ 

m H 



(25) 



In the asymptotic regime and for very small p- values, 
a procedure [77{, based on the result of Ref. [78j], ex- 
ists to estimate the tail probability for qo(rriH)- The 
procedure requires an estimate for the average number 
of up-crossings of go( m ff) above some threshold. Due 
to the m^f-dependence in the model, changes in cuts, 
and the list of channels included, it is technically diffi- 
cult to estimate this quantity with ensembles of pseudo- 
experiments. Instead, a simple alternative method is 
used in which the average number of up-crossings is esti- 
mated by counting the number of up-crossings with the 
observed data. When the trials factor is large, the num- 
ber of up-crossings at low thresholds is also large and thus 
a satisfactory estimate of the average (25|. This proce- 
dure has been checked using a large number of pseudo- 
experiments in a simplified test case, where it provided a 
good estimate of the trials factor. 



VI. EXCLUSION LIMITS 

The model discussed in Section [TT| is used for all chan- 
nels described in Section IIIII and the systematic uncer- 
tainties summarized in Section IIVI The statistical meth- 
ods described in Section [V] are used to set limits on the 
signal strength as a function of mu- 

The expected and observed limits from the individual 
channels entering this combination are shown in Fig. [31 
The combined 95% CL exclusion limits on \x are shown 
in Fig. @] as a function of mn- These results are based on 
the asymptotic approximation. The ±1<7 and ±2a vari- 
ation bands around the median background expectation 
are calculated using the improved procedure described in 
Section IV D[ which yields slightly larger bands compared 
to those in Ref. [3. Typically the increase in the bands 
is of the order of ~ 5% for the ±1<t band and 10-15% for 
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FIG. 3. The observed (solid) and expected (dashed) 95% CL 
cross section upper limits for the individual search channels 
and the combination, normalized to the SM Higgs boson pro- 
duction cross section, as a function of the Higgs boson mass; 
(a) for the full Higgs boson mass hypotheses range and (b) 
in the low mass range. The expected limits are those for the 
background-only hypothesis i.e. in the absence of a Higgs 
boson signal. 

The expected 95% CL exclusion region for the SM 
(/j = 1) hypothesis covers the tuh range from 120 GeV 
to 560 GeV. The addition of the H -» r + r~ and H -> bb 
channels as well as the update of the H — > IFW^*) — > 
l^vi~v channel bring a significant gain in sensitivity in 
the low-mass region with respect to the previous com- 
bined search. For Higgs boson mass hypotheses below ap- 
proximately 122 GeV the dominant channel is H —¥ 77. 
For mass hypotheses larger than 122 GeV but smaller 
than 200 GeV the H -> WW^ ->■ t+vl~V channel is 
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FIG. 4. The observed (full line) and expected (dashed line) 
95% CL combined upper limits on the SM Higgs boson pro- 
duction cross section divided by the SM expectation as a func- 
tion of tub , (a) in the full mass range considered in this anal- 
ysis and (b) in the low mass range. The dotted curves show 
the median expected limit in the absence of a signal and the 
green and yellow bands indicate the corresponding ±lcr and 
±2<r intervals. 



ZZ -> £+£-W 
The updates of 
WW -> £v<g, 
— > £ + £~qq chan- 



the most sensitive. In the mass range between ~200 GeV 
and ~300 GeV the H -> ZZ^> -> £+£-£+£- dominates. 
For higher mass hypotheses the H - 
channel leads the search sensitivity, 
the H -> WW^ _> e + u£~V, H - 
H -> ZZ -> £+£-pv, and H ^ ZZ 
nels improve the sensitivity in the high-mass region. 

The observed exclusion regions range from 111.4 GeV 
to 116.6 GeV, from 119.4 GeV to 122.1 GeV, and from 
129.2 GeV to 541 GeV at 95% CL under the SM (fi = 1) 
hypothesis. The mass range 122.1 GeV to 129.2 GeV 
is not excluded due to the observation of an excess of 
events above the expected background. This excess and 
its significance are discussed in detail in Section FVlTl 

Two mass regions where the observed exclusion is 



stronger than expected can be seen in Fig. |4j In the low 
mass range, Higgs mass hypotheses in the 111.4 GeV to 
116.6 GeV range are excluded due mainly to a local deficit 
of events in the diphoton channel with respect to the ex- 
pected background. A similar deficit is observed in the 
high mass region in the range 360 GeV to 420 GeV, re- 
sulting from deficits in the H -> ZZ&> -> £+£-£+£' and 
H ZZ — > £ + £~vV channels. Both fluctuations corre- 
spond to approximately two standard deviations in the 
distribution of upper limits expected from background 
only. 

A small mass region near mn ~ 245 GeV was not ex- 
cluded at the 95% CL in the combined search of Ref. fl4j . 
mainly due to a slight excess in the H — > ZZ^> — > 
£ + £~£ + £~ channel. This mass region is now excluded. 
The CL S values for fi = 1 as a function of the Higgs bo- 
son are shown in Fig. [5j where it can also be seen that 
the region between 130.7 GeV and 506 GeV is excluded 
at the 99% CL. The observed exclusion covers a large 
part of the expected exclusion range. 



VII. SIGNIFICANCE OF THE EXCESS 

The observed local p-values, calculated using the 
asymptotic approximation, as a function of mn and the 
expected value in the presence of a SM Higgs boson signal 
at that mass are shown in Fig.[6]in the entire search mass 
range and in the low mass range. The asymptotic approx- 
imation has been verified using ensemble tests which yield 
numerically consistent results. 

The largest significance for the combination is observed 
for 771^=126 GeV, where it reaches 3. Oct with an expected 
value in the presence of a signal at that mass of 2.9a. The 
observed (expected) local significances for ma =126 GeV 
are 2.8ct (1-4ct) in the H — > 77 channel and 2.1ct (1-4ct) 
in the H -> ZZ& -> £+£-£+£- channel. In the H -> 
VF"[y(*) — > £+v£~v channel, which has been updated and 
includes additional data, the observed (expected) local 
significance for mjj=126 GeV is 0.8ct (1.9ct); the observed 
significance was previously 1.4ct [l4| . 

The significance of the excess is not very sensitive to 
energy scale and resolution systematic uncertainties for 
photons and electrons; however, the presence of these 
uncertainties leads to a small deviation from the asymp- 
totic approximation. The observed po including these 
effects is therefore estimated using ensemble tests. The 
results are displayed in Fig. [6] as a function of m#. The 
effect of the energy scale systematic uncertainties is an 
increase of approximately 30% of the corresponding local 
Pq. The maximum local significance decreases slightly to 
2.9ct. The muon momentum scale systematic uncertain- 
ties are smaller and therefore neglected. 

The global p-values for the largest excess depends on 
the range of m# and the channels considered. The global 
Po associated with a 2.8ct excess anywhere in the H — > 77 
search domain of 110-150 GeV is approximately 7%. A 
2.1a excess anywhere in the H -> ZZ& -t £ + £~£+£- 
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FIG. 5. The value of the combined CL S for fi — 1 (testing 
the SM Higgs boson hypothesis) as a function of run, (a) in 
the full mass range of this analysis and (b) in the low mass 
range. The regions with CL a < a are excluded at the (1 — a) 
CL. 

search range of 110-600 GeV corresponds to a global po 
of approximately 30%. 

The global probability for a 2.9a excess in the com- 
bined search to occur anywhere in the mass range 110— 
600 GeV is estimated to be approximately 15%, decreas- 
ing to 5-7% in the range 110-146 GeV, which is not ex- 
cluded at the 99% CL by the LHC combined SM Higgs 
boson search [g^] • The data are observed to be consistent 
with the background-only hypothesis except for the re- 
gion around tuh — 126 GeV. The observed and expected 
ratio —2 ln(A(l)/A(0)) is shown in Fig. [3 which indicates 
a departure from the background-only hypothesis similar 
to the signal-plus-background expectation. 



5 1 








ni 
o 








o 
_i 






1o 


10- 1 




f — Obs. Asymptotic w/o ESSE 
f — Exp. Asymptotic w/o ESS: 
/ -s- Obs. Ensemble w/ ESS Z 


2a 


icr 2 


z ATLAS 2011 1 


j J*Ldt = 4.6-4.9 fb" 1 1 
U /s=7TeV 


3a 


1(T 3 




,,,,'«,,,, 1 T ,,,, 1 ,,,, 1 ,, 





110 115 120 125 130 135 140 145 150 

m H [GeV] 

(c) 



FIG. 6. The local probability po for a background-only ex- 
periment to be more signal-like than the observation, for in- 
dividual channels and the combination, (a) In the full mass 
range of 110-600 GeV and (b) in the low mass range of 110- 
150 GeV. The full curves give the observed individual and 
combined po- The dashed curves show the median expected 
value under the hypothesis of a SM Higgs boson signal at 
that mass. The combined observed local po estimated using 
ensemble tests and taking into account energy scale system- 
atic (ESS) uncertainties is illustrated in (c), the observed and 
expected combined results using asymptotic formulae is also 
shown. The horizontal dashed lines indicate the po corre- 
sponding to significances of la, 2a, and 3<r for (a), (b) and 
(c). 
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FIG. 7. The ratio of profile likelihoods for /j, = and /i = 1 
as a function of the Higgs boson mass hypothesis. The full 
line shows the observed ratio, the lower dashed line shows 
the median value expected under the signal-plus-background 
hypothesis, and the upper dashed line shows the median ex- 
pected under the background-only hypothesis (a) for the full 
mass range and (b) the low mass range. The ±l<r and ±2<r 
intervals around the median background-only expectation are 
given by the green and yellow bands, respectively. 

The best-fit value of /i, denoted p,, is displayed for the 
combination of all channels in Fig. |8] and for individual 
channels in Fig. |9] as a function of the m# hypothesis. 
A summary of — 21nA(/tt) < 1 intervals at three specific 
Higgs boson mass hypotheses (toh=119 GeV, 126 GeV 
and 130 GeV) for each Higgs decay mode and the com- 
bination is given in Fig. 1101 The bands around ft il- 
lustrate the fj, interval corresponding to — 21nA(/i) < 1 
and represent an approximate ±1<7 variation. While the 
estimator p is allowed to be negative in Figs. [5] and [§] 
in order to illustrate the presence and extent of down- 
ward fluctuations, the /j, parameter is bounded to ensure 
non-negative values of the probability density functions 
in the individual channels. Hence, for negative p values 
close to the boundary, the —2 In \{fx) < 1 region does not 
reflect a calibrated 68% confidence interval. It should be 



noted that the p does not directly provide information 
on the relative strength of the production modes. The 
excess observed for win = 126 GeV corresponds to a p 
of 1.1 ±0.4, which is compatible with the signal strength 
expected from a SM Higgs boson at that mass (/x = 1). 
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FIG. 8. The combined best-fit signal strength /i as a function 
of the Higgs boson mass hypothesis (a) in the full mass range 
of this analysis and (b) in the low mass range. The interval 
around jX corresponds to a variation of — 21nA(/i) < 1. 



VIII. CONCLUSION 

A combined search for the Standard Model Higgs bo- 
son has been performed with the ATLAS detector in the 
y/s = 7 TeV pp collision data collected in 2011, corre- 
sponding to an integrated luminosity of 4.6-4.9 fb -1 . The 
channels used in this combination are H — > 77, H — > bb, 
H -> t+t-, H -> ZZ& -> £+£-£+£-, H -> ZZ -> 
l + £-qq, H -> ZZ -> l+l-yv, H -> WW^ -> £ + u£~v, 
and H — > WW — > Ivqq' . The observed exclusion ranges 
at the 95% CL are 111.4 GeV to 116.6 GeV, 119.4 GeV to 
122.1 GeV, and 129.2 GeV to 541 GeV, while Higgs bo- 
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FIG. 9. The best-fit signal strength /j, as a function of the Higgs boson mass hypothesis for the channels: (a) H — > 77, (b) 
H -> ZZ M -> £ + £~£ + £~ in the low mass region, (c) # -> ZZ W -> £ + £~£ + £~ across the full search range, (d) H -> -> 
e+l'UV, (e) H -> -> i + £~qq, (f) i? -> ZZ W for all sub-channels across the full search range, (g) H -> ff^'*' -» tvl'V 
in the low mass region, > across the full search range, (i) H -> WW -> ft/gf , (j) H -> for 

all sub-channels in the full mass range, (k) H — s> tt, and (1) // — >• feb. The band shows the interval around fi corresponding to 
a variation of — 21nA(/i) < 1. 



son masses between 120 GeV and 560 GeV are expected 
to be excluded at the 95% CL. The mass region between 
130.7 GeV and 506 GeV is excluded at the 99% CL. 

The local significance of the observed excess when all 



channels arc combined is 2.9a, in good agreement with 
the expected significance in the presence of a SM Higgs 
boson with m#=126 GeV of 2.9er. An estimate of the 
global probability for such an excess to occur anywhere in 
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FIG. 10. Summary of the individual and combined best-fit 
values of the strength parameter for three sample Higgs boson 
mass hypotheses 119 GeV, 126 GeV (where the maximum 
observed significance is reached) and 130 GeV. 



the full explored Higgs boson mass region (from 110 GeV 
to 600 GeV) is approximately 15%. The global po in 
the range not excluded at the 99% CL by the LHC com- 
bined Higgs boson search results [62j (from 110 GeV to 
146 GeV) is approximately 5-7%. 
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